A De Novo-Assembly Based Data Analysis Pipeline for Plant Obligate Parasite Metatranscriptomic Studies
نویسندگان
چکیده
Current and emerging plant diseases caused by obligate parasitic microbes such as rusts, downy mildews, and powdery mildews threaten worldwide crop production and food safety. These obligate parasites are typically unculturable in the laboratory, posing technical challenges to characterize them at the genetic and genomic level. Here we have developed a data analysis pipeline integrating several bioinformatic software programs. This pipeline facilitates rapid gene discovery and expression analysis of a plant host and its obligate parasite simultaneously by next generation sequencing of mixed host and pathogen RNA (i.e., metatranscriptomics). We applied this pipeline to metatranscriptomic sequencing data of sweet basil (Ocimum basilicum) and its obligate downy mildew parasite Peronospora belbahrii, both lacking a sequenced genome. Even with a single data point, we were able to identify both candidate host defense genes and pathogen virulence genes that are highly expressed during infection. This demonstrates the power of this pipeline for identifying genes important in host-pathogen interactions without prior genomic information for either the plant host or the obligate biotrophic pathogen. The simplicity of this pipeline makes it accessible to researchers with limited computational skills and applicable to metatranscriptomic data analysis in a wide range of plant-obligate-parasite systems.
منابع مشابه
A machine learning pipeline to improve De Bruijn graph metatranscriptomic assemblies
Motivation: With the growing significance of metatranscriptomic assemblies, the need to improve their quality and maintain their controllable size has become essential. That would help in boosting all applications based on metatranscriptomic assembly. In this paper, we propose a pipeline that filters de novo assemblies while preserving or improving their quality. Original assemblies are based o...
متن کاملClustering of Short Read Sequences for de novo Transcriptome Assembly
Given the importance of transcriptome analysis in various biological studies and considering thevast amount of whole transcriptome sequencing data, it seems necessary to develop analgorithm to assemble transcriptome data. In this study we propose an algorithm fortranscriptome assembly in the absence of a reference genome. First, the contiguous sequencesare generated using de Bruijn graph with d...
متن کاملIDBA-MTP: A Hybrid MetaTranscriptomic Assembler Based on Protein Information
Metatranscriptomic analysis provides information on how a microbial community reacts to environmental changes. Using next-generation sequencing (NGS) technology, biologists can study the microbe community by sampling short reads from a mixture of mRNAs (metatranscriptomic data). As most microbial genome sequences are unknown, it would seem that de novo assembly of the mRNAs is needed. However, ...
متن کاملFunctional Profiling of Unfamiliar Microbial Communities Using a Validated De Novo Assembly Metatranscriptome Pipeline
BACKGROUND Metatranscriptomic landscapes can provide insights in functional relationships within natural microbial communities. Analysis of complex metatranscriptome datasets of these communities poses a considerable bioinformatic challenge since they are non-restricted with a varying number of participating strains and species. For RNA-Seq data a standard approach is to align the generated rea...
متن کاملIMP : a pipeline for reproducible integrated 1 metagenomic and metatranscriptomic analyses
20 We present IMP, an automated pipeline for reproducible integrated analyses of coupled 21 metagenomic and metatranscriptomic data. IMP incorporates preprocessing, iterative co22 assembly of metagenomic and metatranscriptomic data, analyses of microbial community 23 structure and function as well as genomic signature-based visualizations. Complementary use 24 of metagenomic and metatranscripto...
متن کامل